Internvideo2 Chat 8B InternLM2 5
MIT
InternVideo2-Chat-8B-InternLM2.5 is a video-text multimodal model that enhances video understanding and human-computer interaction by integrating the InternVideo2 video encoder with a large language model (LLM).
Video-to-Text
Safetensors